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AMENDED CLAIMS 

Claims 1-16 - cancelled 

17. (New) A method of selecting one or more variables for use with 
a statistical model, the method comprising the steps of: 

creating a plurality of unique subsets of variables of multivariate 

data; 

determining the performance of a discriminant rule when used with 
each of the subsets, the discriminant rule being based on multivariate 
normal class densities each having substantially diagonal covariance 
matrices; and 

selecting the one or more variables from at least one of the subsets 
that result in a desired performance of the discriminant rule. 

18. (New) The method as claimed in claim 17, wherein the step of 
creating the plurality of unique subsets comprises the step of identifying a 
variable in the multivariate data that is not a member of a set of variables, 
and adding the identified variable to the set. 
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1 9. (New) The method as claimed in claim 1 7, wherein the step of 
determining the performance of the discriminant rule comprises assessing a 
prediction error rate of the discriminant rule. 

20. (New) The method as claimed in claim 1 9, wherein 
the prediction error rate is a cross-validated error rate. 

21 . (New) The method as claimed in claim 1 7, wherein the desired 
performance of the discriminant rule comprises the lowest possible 
prediction error rate of the discriminant rule. 

22. (New) The method as claimed in claim 1 7, wherein the 
multivariate data comprises gene expression data. 

23. (New) Computer software which, when executed by a computer, 
enables the computer to carry out the method as claimed in claim 17. 

24. (New) A computer storage medium comprising the software as 
claimed in claim 23. 

25. (New) A statistical model for predicting a class of an 
observation, wherein the model includes one or more variables that have 
been selected using the method defined in claim 17. 

26. (New) An apparatus for selecting one or more variables for use 
with a statistical model, the system comprising: 

data creating means arranged to create a plurality of unique subsets 
of variables of multivariate data; 

a processing means arranged to determine the performance of a 
discriminant rule when used with each of the subsets, the discriminant rule 
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being based on multivariate normal class densities each having substantially 
diagonal covariance matrices; and 

a selecting means arranged to select the one or more variables from 
at least one of the subsets that results in a desired performance of the 
discriminant rule. 

27. (New) The apparatus as claimed in claim 26, wherein the data 
creating means is arranged to create the plurality of unique subsets by 
identifying a variable in the multivariate data that is not a member of a set 
of variables, and adding the identified variable to the set. 

28. (New) The apparatus as claimed in claim 26, wherein the 
determining means is arranged to determine the performance of the 
discriminant rule by assessing a prediction error rate of the discriminant 
rule. 

29. (New) The apparatus as claimed in claim 28, wherein the 
prediction error rate is a cross-validated error rate. 

30. (New) The apparatus as claimed in claim 26, wherein the desired 
performance of the discriminant rule comprises the lowest possible 
prediction error rate of the discriminant rule. 

31 . (New) The apparatus as claimed in claim 26, wherein the 
multivariate data comprises gene expression data. 

32. (New) The apparatus as claimed in claim 26, wherein the data 
creating means, processing means and selecting means are in the form of a 
computer running software. 



